Effective balancing error and user effort in interactive handwriting recognition

نویسندگان

  • Nicolás Serrano
  • Jorge Civera
  • Alberto Sanchís
  • Alfons Juan-Císcar
چکیده

Transcription of handwritten text documents is an expensive and timeconsuming task. Unfortunately, the accuracy of current state-of-the-art handwriting recognition systems cannot guarantee fully-automatic high quality transcriptions, so we need to revert to the computer assisted approach. Although this approach reduces the user effort needed to transcribe a given document, the transcription of handwriting text documents still requires complete manual supervision. An especially appealing scenario is the interactive transcription of handwriting documents, in which the user defines the amount of errors that can be tolerated in the final transcribed document. Under this scenario, the transcription of a handwriting text document could be obtained efficiently, supervising only a certain number of incorrectly recognised words. In this work, we develop a new method for predicting the error rate in a block of automatically recognised words, and estimate how much effort is required to correct a transcription to a certain user-defined error rate. The proposed method is included in an interactive approach to tranTel: (+34) 963877350 + 73533 Fax: (+34) 963877359 Email address: {nserrano,jcivera,josanna,ajuan}@dsic.upv.es (N. Serrano, J. Civera, A. Sanchis and A. Juan) Preprint submitted to Pattern Recognition Letters May 18, 2015 scribing handwritten text documents, which efficiently employs user interactions by means of active and semi-supervised learning techniques, along with a hypothesis recomputation algorithm based on constrained Viterbi search. Transcription results, in terms of trade-off between user effort and transcription accuracy, are reported for two real handwritten documents, and prove the effectiveness of the proposed approach.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Combining Neural Networks and Context-Driven Search for On-line, Printed Handwriting Recognition in the Newton

MESSAGEPAD and EMATE. Combining an artificial neural network (ANN) as a character classifier with a context-driven search over segmentation and word-recognition hypotheses provides an effective recognition system. Long-standing issues relative to training, generalization, segmentation, models of context, probabilistic formalisms, and so on, need to be resolved, however, to achieve excellent per...

متن کامل

Error Repair in Human Handwriting - An Intelligent User Interface for Automatic On-Line Handwriting Recognition

Several important factors, such as recognition accuracy, user acceptance, and system usability, have to be considered in designing an interface of a handwriting recognition system. Since both users and recognition algorithms make mistakes, it is desirable for the user interface of a handwriting recognition system to have mechanisms recovering from errors. In this paper we address the problem of...

متن کامل

Confidence Measures for Error Correction in Interactive Transcription Handwritten Text

An effective approach to transcribe old text documents is to follow an interactive-predictive paradigm in which both, the system is guided by the human supervisor, and the supervisor is assisted by the system to complete the transcription task as efficiently as possible. In this paper, we focus on a particular system prototype called GIDOC, which can be seen as a first attempt to provide user-f...

متن کامل

Context-Aware Gestures for Mixed-Initiative Text Editing UIs

This work is focused on enhancing highly interactive text-editing applications with gestures. Concretely, we study CATTI, a handwriting transcription system that follows a corrective feedback paradigm, where both the user and the system collaborate efficiently to produce a high-quality text transcription. CATTI-like applications demand fast and accurate gesture recognition, for which we observe...

متن کامل

Preprocessing and Feature Extraction Techniques for Multimodal Interactive Transcription of Text Images

To date, automatic handwriting recognition systems are far from being perfect and heavy human intervention is often required to check and correct the results of such systems. This “post-editing” process is both inefficient and uncomfortable to the user. An example is the transcription of historic documents: State-of-the-art handwritten text recognition technology is not suitable to perform this...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Pattern Recognition Letters

دوره 37  شماره 

صفحات  -

تاریخ انتشار 2014